NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Experimenting with Zero-Knowledge Proofs of Training

https://doi.org/10.1145/3576915.3623202

Garg, Sanjam; Goel, Aarushi; Jha, Somesh; Mahloujifar, Saeed; Mahmoody, Mohammad; Policharla, Guru-Vamsi; Wang, Mingyuan (November 2023, ACM)
Effectively Using Public Data in Privacy Preserving Machine Learning

Nasr, Milad; Mahloujifar, Saeed; Tang, Xinyu; Mittal, Prateek; Houmansadr, Amir (January 2023, ICML)

Full Text Available
Overparameterization from Computational Constraints

Garg, Sanjam; Jha, Somesh; Mahloujifar, Saeed; Mahmoody, Mohammad; Wang, Mingyuan. (December 2022, Curran Associates)

Full Text Available
Machine Learning with Differentially Private Labels: Mechanisms and Frameworks

https://doi.org/10.56553/popets-2022-0112

Tang, Xinyu; Nasr, Milad; Mahloujifar, Saeed; Shejwalkar, Virat; Song, Liwei; Houmansadr, Amir; Mittal, Prateek (October 2022, Proceedings on Privacy Enhancing Technologies)

Label differential privacy is a relaxation of differential privacy for machine learning scenarios where the labels are the only sensitive information that needs to be protected in the training data. For example, imagine a survey from a participant in a university class about their vaccination status. Some attributes of the students are publicly available but their vaccination status is sensitive information and must remain private. Now if we want to train a model that predicts whether a student has received vaccination using only their public information, we can use label-DP. Recent works on label-DP use different ways of adding noise to the labels in order to obtain label-DP models. In this work, we present novel techniques for training models with label-DP guarantees by leveraging unsupervised learning and semi-supervised learning, enabling us to inject less noise while obtaining the same privacy, therefore achieving a better utility-privacy trade-off. We first introduce a framework that starts with an unsupervised classifier f0 and dataset D with noisy label set Y , reduces the noise in Y using f0 , and then trains a new model f using the less noisy dataset. Our noise reduction strategy uses the model f0 to remove the noisy labels that are incorrect with high probability. Then we use semi-supervised learning to train a model using the remaining labels. We instantiate this framework with multiple ways of obtaining the noisy labels and also the base classifier. As an alternative way to reduce the noise, we explore the effect of using unsupervised learning: we only add noise to a majority voting step for associating the learned clusters with a cluster label (as opposed to adding noise to individual labels); the reduced sensitivity enables us to add less noise. Our experiments show that these techniques can significantly outperform the prior works on label-DP.
more » « less
Full Text Available
A Separation Result Between Data-oblivious and Data-aware Poisoning Attacks

Deng, Samuel; Garg, Sanjam; Jhan, Somesh; Mahloujifar, Saeed; Mahmoody, Mohammad; Thakurta, Abhradeep. (December 2021, Curran Associates)

Full Text Available
Lower Bounds for Adversarially Robust PAC Learning under Evasion and Hybrid Attacks

Diochnos, Dimitrios I.; Mahloujifar, Saeed; Mahmoody, Mohammad. (December 2020, IEEE)

Full Text Available
Model-Targeted Poisoning Attacks with Provable Convergence

Suya, Fnu; Mahloujifar, Saeed; Suri, Anshuman; Evans, David; Tian, Yuan (January 2021, 38th International Conference on Machine Learning)
null (Ed.)
In a poisoning attack, an adversary with control over a small fraction of the training data attempts to select that data in a way that induces a corrupted model that misbehaves in favor of the adversary. We consider poisoning attacks against convex machine learning models and propose an efficient poisoning attack designed to induce a specified model. Unlike previous model-targeted poisoning attacks, our attack comes with provable convergence to any attainable target classifier. The distance from the induced classifier to the target classifier is inversely proportional to the square root of the number of poisoning points. We also provide a lower bound on the minimum number of poisoning points needed to achieve a given target classifier. Our method uses online convex optimization, so finds poisoning points incrementally. This provides more flexibility than previous attacks which require a priori assumption about the number of poisoning points. Our attack is the first model-targeted poisoning attack that provides provable convergence for convex models, and in our experiments, it either exceeds or matches state-of-the-art attacks in terms of attack success rate and distance to the target model.
more » « less
Full Text Available
Adversarially Robust Learning Could Leverage Computational Hardness

Garg, Sanjam; Jha, Somesh; Mahloujifar, Saeed; Mahmoody, Mohammad. (January 2020, Proceedings of Machine Learning Research)

Full Text Available
Empirically Measuring Concentration: Fundamental Limits on Intrinsic Robustness

Mahloujifar, Saeed; Zhang, Xiao; Mahmoody, Mohammad; Evans, David. (December 2019, Curran Associates)

Full Text Available
Empirically Measuring Concentration: Fundamental Limits on Intrinsic Robustness

Mahloujifar, Saeed; Zhang, Xiao; Mahmoody, Mohammad; Evans, David (January 2019, ICLR Workshop on Debugging Machine Learning Models)

Many recent works have shown that adversarial examples that fool classifiers can be found by minimally perturbing a normal input. Recent theoretical results, starting with Gilmer et al. (2018), show that if the inputs are drawn from a concentrated metric probability space, then adversarial examples with small perturbation are inevitable. A concentrated space has the property that any subset with Ω(1) (e.g., 1/100) measure, according to the imposed distribution, has small distance to almost all (e.g., 99/100) of the points in the space. It is not clear, however, whether these theoretical results apply to actual distributions such as images. This paper presents a method for empirically measuring and bounding the concentration of a concrete dataset which is proven to converge to the actual concentration. We use it to empirically estimate the intrinsic robustness to ℓ∞ and ℓ2 perturbations of several image classification benchmarks.
more » « less
Full Text Available

Search for: All records